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Introduction 

The genetic code of the last common ancestor (LCA), or a minor variant of it, is 
présent in ail species. Its origin, in the pre-LCA era, has remained an enigma for four 
décades. My analysis reveals that ail régularises dantiied in ccde str_. >:tu r e correlate 
strongly with path-distances in amino acid synthesis. The code accordingly evolved by 
adding amino acids as they appeared. du ring the growth of synthesis pathways 
outwardfrom central metabolism. Codon assignments in the 'universal code' were 
found to dérive from ancient transitional codes, formed deep within the pre-LCA era. 

Design of Study 

• Amino acid path-distsncas were svaluated as the number of reaction steps, from 
citrate cycle, required for synthesis. 

• Distinct transition codes were identified and used to reconstruct code évolution. 

• The path-distance model was then validated by showing that it unifies over twenty 
diverse structural regulartties îdenl ned in the genetic code and pre-LCA proteins. 

Amino Acid Synthesis Pathways 


Amino Acid and Codon Distribution on Synthesis Pathways 


• Amino acids 
form on ancient 
pathways 
catalyzed by 
fifty pre-LCA 
enzymes. 

• The twenty 

acids subdivide 
into four families 
with precursors 
OAA r aKG r Pyr r 
and PEP. 


Path-length contour 
[7] No. pre-LCA enzymes in paths 
[CGN] Codons. N, any base, R, purine 

Y pyrimidine 
RCC Reductive citrate cycle 
RPC Reductive pentose cycle 
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Path with multianionic components 

Scaled amino acid synthesis path 

■■■ Unscaled segment of path 

Segment with alpha-amino acids 

CT Central trunk 


Precursor-Product Amino Acid Pairs hâve Time-Ordered Codons 
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• Extensicn of amino acid synthesi 

(i) Codo-s in precursor/product a 

(ii) Mid-base and mean synihasis 

(2-7 steps] amino acids. 


pathways and code formation were linked as: 
lino acid pairs exhibit time-ordered 5'- and mid-base. 
)ath-length (L) correlate strongly among short-path 


Path-Distance Model Explains Code Structure 

. Woese (1965): NAN column tripk 

wt-iile NUN tripk 


Amino acids with short, médium, and long 
paths are chemically distinct and encoded 
differently: 

(1] Four NtV fïxers form in 2 steps (or less). 
include both anionic residues (red). and 

hâve codons solely from the NAN column. 

(2) Ten amino acids with alkyl. hydroxy and 
S-bearing side-chains form on 4-7 step 
paths. Consensus codons for 4-, 5-, and 

7-step residues are respectively from the 
NCN, NGN, and NUN column. 7 of 8 'sets 
of four' (intact code boxes] encode them. 

(3) Six basic (blue) and aromatic residues 
form on 9-14 step paths. They are 
encoded mainly by codon doublets. 

A 14-îcic :s:*pone"î a ) 'a i-cff m ccco" 
assignments over paths of 4- to 14- steps 
conforms with graduai slowing in the tempo 
évolution leading to the 'universal code'. 
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2. Taylor and Coates (1! 


tact boxes code for the 
o acids in proteins. 


EFfective path length (L) 
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Transition Codes Revealed by Amino Acid Path-Distances 


3. Perlwitz et al. (1988): Codori rnid-bas.e hôs mon ocding capacity. 


• NAN column codons for four 
'2-step' amino acids (NH 4 + 
fixers) and STOP signal are 
identified as a vestige of the 
first code. Codons of amino 
acids with 2-, 4-, 5-, and 7- 
step paths exhibit column- 

NAN-»NCN->NGN->NUN 
consistent with column-by- 
column growth of the code. 

• Early amino acids (2-7 step 
paths) acquired 7 of 8 intact 
boxes, suggesting each 
was allotted an intact box of 
codons on entering the code. 

• Latecomer amino acids (9- 
14 step paths) share 6 of 8 
subdivided boxes with an 
early amino acid (2-7 step 
path), or stop signal. This 

ne cotes they captured codons 
from early amino acids. 


coding capacity 
per codon site 


lino acids added per 
stage of code évolution 
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Conclusion 

Amino acid synthesis path length strongly influenced codon acquisition. 
Analysis of this relation reveals the gsrieik cûde e-, cives n three steps: 
(i) NH 4 + Fixers Code, with NAN column triplets, (ii) Code Expansion, and 
fiii; Oveicrn:ing oy latairo^ers. \ci only s :h ^ seneti: coce '.nive'sol 
therefore. it conser-es .es:igsi£ of ~'iï :rsnsit o" codas thaï shaped it. tRNA 
with joint cofactor (amino acid synthesis) and adaptor (translation) functions 
are implicated in linking code formation with pre-LCA path extension. 

Référence 

B. K. Davis, 2007. In Leading-Eclge HesseriQB' fi.WA Research Communications 
M. H. Ostrovskiy, Ed. New York: Nova Science, pp 1-32. 


